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Abstract. Consider the empirical spectral distribution of complex ran- 
dom nxn matrix whose entries are independent and identically distributed 
random variables with mean zero and variance 1/n. In this paper, via ap- 
plying potential theory in the complex plane and analyzing extreme singular 
I values, we prove that this distribution converges, with probability one, to 

PLh ■ the uniform distribution over the unit disk in the complex plane, i.e. the 

well known circular law, under the finite fourth moment assumption on 
matrix elements. 
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1. Introduction 



(N 

^ ! Let {Xkj}, k,j = ■ ■ ■ , be a double array of independent and identically 

I distributed (i.i.d.) complex random variables (r.v.'s) with EXu = and 

■ E'lXiip = 1. The complex eigenvalues of the matrix n~^/^X = n~^/^(X, 



are 



denoted by Ai, ■ ■ ■ , A„. The two-dimensional empirical spectral distribution 
^n{x,y) is defined as 

1 

(1.1) = - I{Re{Xk) < X, Im{\k) < y). 

k=l 

^ \ The study of /in(a;, y) is related to understanding the random behavior of slow 

■ neutron resonances in nuclear physics. See [TB]. Since 1950's it has been 

conjectured that, under the unit varaince condition, converges to the 

so-called circular law, i.e. the uniform distribution over the unit disk in the 
complex plane. Up to now, this conjecture is only proved in some partial cases. 

The first answer for complex normal matrices was given in [16j based on the 
joint density function of the eigenvalues of n~^/^X. Huang in yjj reported that 
this result was obtained in an unpublished paper of Silverstein (1984). After 
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more than one decade, Edelman [7j also showed that the expected empirical 
spectral distribution converges to the circular law for real normal matrices. It 
is Girko who investigated the circular law for general matrix with indepen- 
dent entries for the first time in [8]. But Girko imposed, not only moment 
conditions, but also strong smooth conditions on matrix entries. Later on, 
he further published a series of papers (for example, [9]) about this problem. 
However, as pointed out in [1] and [TOj, Girko's argument includes serious 
mathematical gaps. The rigorous argument of the conjecture was given by Bai 
in his 1997 celebrated paper pj for general random matrices. In addition to 
the finite (4 + e)th moment condition Bai still assumed that the joint density 
of the real and imaginary part of the entries is bounded. Again, the result was 
further improved by Bai and Silverstein under the assumption E\Xii\'^~^^ < oo 
in their comprehensive book [2J, but the finiteness condition of the density of 
matrix entries is still there. Recently, Gotze and Tikhomirov [IQl gave a proof 
of the convergence of EiJ,n{x,y) to the circular law under the strong moment 
assumption that the entries have sub-Gaussian tails or are sparsely non-zero 
instead of the condition about the density of the entries in [I]. 

Generally speaking, there are five approaches to studying the spectral distri- 
bution of random matrices. The difficulty of the circular conjecture is that the 
methodologies used in Hermitian matrices do not work well in non-Hermitian 
ones. There was no powerful tool to attack this conjecture. 

1. Moment method. Moments are very important characteristics of r.v.'s. 
They have many applications in probability and statistics. For example, we 
have moment estimators in statistics. As far as we know, it is Wigner [23] 
[23] who introduced moment method into random matrices. Since then, the 
moment method has been very successful in establishing the convergence of 
the empirical spectral distribution of Hermitian matrices. Bai did a lot of 
important work. One can refer to [2]- But moment method fails to work in 
non-Hermitian ones, because for any complex r.v. Z uniformly distributed 
over any disk centered at 0, one can verify that for any m > 1 



2. Stietjes transform. Another powerful tool in random matrices theory is 
the Stieltjes transform, which is defined by 



for any distribution function G{x). The basic property of Stieltjes transform is 
that it is a representing class of probability measures. This property offers one 
a strong analytic machine. Still see [2] and the references therein. However, the 
Stieltjes transform of n~^/^X is unbounded if z coincides with one eigenvalue. 
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So this leads to serious difficulties when dealing with the Stieltjes transform 



3. Orthogonal polynomials. The study of orthogonal polynomials goes back 
as far as Hermite. For the deep connections between orthogonal polynomials 
and random matrices, one can refer to [3]. Orthogonal polynomials are usually 
limited to Guassian random matrices. Moreover, orthogonal polynomials are 
only suitable to deriving the spacing between consecutive eigenvalues for large 
classes of random matrices (see [1]). 

4. Characteristic functions. There is a long history of characteristic func- 
tions. In 1810, Laplace used Fourier transform, i.e. characteristic functions to 
prove central limit theorem for bounded r.v.'s. Then in 1934 P. Levy reproved 
Linderberg central limit theorem by characteristic functions. From that time 
on, characteristic functions are well known to almost every mathematician. 
Surprisingly, one can not see any application of characteristic functions in ran- 
dom matrices until 1984. Girko combined together the characteristic function 
of fin{x,y) and the Stieltjes transform, trying to prove the conjecture in [8]. 
Developing ideas proposed by Girko [8j, Bai reduced the conjecture to esti- 
mating the smallest singular value of n~^/^X — 2:1 in [1]. However, one should 
note that some uniform estimate of the smallest singular values of n~^/^X — 2:1 
with respect to z will be required if the method in pLj is employed. 

5. Potential theory. Potential theory is the terminology given to the wide 
area of analysis encompassing such topics as harmonic and subharmonic func- 
tions, the boundary problem, harmonic measure. Green's function, potentials 
and capacity. Since Doob's famous book [5] appeared, it is widely accepted 
that potential theory and probability theory are closely related. For example, 
superharmonic functions correspond to supermartingales. 

The logarithmic potential of a measure /i (see [19]) is defined by 



where fi{t) is any positive finite Borel measure with support in a compact 
subset of the complex plane. There is also an inversion formula, i.e. fi can 
be defined through as dfi = —{2n)~^AU^, where A is the two dimensional 
Laplacian operator. This relation makes Khoruzhenko in [12] suggest to use 
potential theory to derive the circular law. Then Gotze and Tikhomirov in [10] 
used the logarithmic potential of Efi^ convoluted by a smooth distribution to 
provide a proof for the convergence of Efin to the circular law with entries 
being sub-Gaussian or sparsely non-zero. 

In this paper, the conjecture, the convergence of /i„(x, y) to the circular law 
with probabihty one, is established under the assumption that the underlying 



of n-i/2X. 



(1.3) 
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r.v.'s have finite fourth moment. Compared with [10], we work on the log- 
arithmic potential of directly, while [lU] depends on the logarithmic 
potential of a convolution of Efin{x,y) and the uniform distribution on the 
disk of radius r. 

The main result of this paper is formulated as follows. 

Theorem 1. Suppose that {Xj^} are i.i.d. complex r.v.'s with EXu = 0, 
E'lXiip = 1 and £"1X111^ < oo. Then, with probability one, the empirical 
spectral distribution function fin{x,y) converges to the uniform distribution 
over the unit disk in two dimensional space. 

Remark 1. The bounded density condition in fTJ and the sub- Gaussian as- 
sumption in JlU\ are not needed any more. 

Theorem [T] will be handled by potential theory in conjunction with estimates 
for the smallest singular value of ra^^^^X — zl. 

The research of the smallest singular values originates from von Neumann 
and his colleagues. They guessed that 

(1.4) s„(X) ~ with high probability, 

with s„(X) being the smallest singular value of X. Edelman in [B] proved it 
for random Gaussian matrices, i.e., for each e > 

(1.5) P(s„(X) < sn-'/^) ~ E. 

Rudelson and Vershynin in ILS\ solved it for real random matrices, i.e., for 
every 6 > there exist e > and Uq depending only on S and the fourth 
moment of Xjk so that 

(1.6) P(s„(X) < £n~^^^) < 5 for all n > uq. 

Moreover, since (11.51) fails to hold for the random sign matrices [Xjk being 
symmetric ±1 r.v.'s), Spielman and Teng [20j speculated that for random sign 
matrices for any e > 

(1.7) P(s„(X) < en-^/^) < e + c" < c < 1. 

Again, (I1.7P has been proved for real random matrices with i.i.d. subgaussian 
entries in |18j . 

We will adapt Rudelson and Vershynin's method to obtain the order of the 
smallest singular value for complex matrices perturbed by a constant matrix. 

Formally, let W = X + A„, where A„ is a fixed complex matrix and 
X = {Xjk), a random matrix. Denote the singular values of W by Si, ■ ■ ■ , s„ 
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arranged in the non-increasing order. Particularly, the smallest singular value 
is 

Sn(W) = inf l|Wx||2, 

xeC":|jx||2 = l 

where || ■ II2 means Euclidean norm, and we denote the spectral norm of a 
matrix by || ■ ||. 

Theorem 2. Let {Xjk} be i.i.d. complex r.v.'s with EXu = 0, = 1 

and E\Xii\^ < B. Let K > 1. Then for every e > 0, 

(1.8) P(s„(W) < en~'^^) < + c" + P(|| W|| > Kn'/^), 

where C > and c G (0, 1) depend only on K, B, E(^Re{Xii))^ , E (^Im{Xii)Y , 
and ERe{Xu)Im{Xu)- 

Remark 2. In Theorem\^ e is arbitrary. It can depend onn. K is a constant 
not smaller than 1 . In Section when we apply ( (i.<g|) in the proof of Theorem 
[21 we will select e = n^^^^, K > 4. 

Remark 3. Theorem\^ includes Theorem 5.1 in [18] as a special case, where 
A„ = 0, the r.v. 's are real and have finite fourth moment. Therefore, U.6\) is 
true with X replaced by W when Xu has finite fourth moment and ||A„|| < 
(C>0), I.e., 

P(sn(W) < en~^^^) < 6 for all n > uq. 

Moreover, if ^ is a subgaussian matrix and ||A„|| < C^Jn, by Lemma 2.4 of 
[18\ or Fact 2.4 of [H], [T77\ ) holds with X replaced by W, i.e., 

i^(sn(W) < en-^/^) <e + c" 0<c<l. 

This exponential rate is better than the polynomial rate in Tao and Vu |22j . 

Furthermore, for general random matrices, similar to steps fl3.3p - fl3.4l) in 
Section 3 one can conclude that 

Corollary 1. In addition to the assumptions of Theorem 0, suppose that 
^ y/n^n O'lT'd ||A„|| < C^/n with < C < 00, then for any e > 

(1.9) P(s„(W) < en-^/2) <Ce + n~\ 

where I is any positive number and — > with the convergence rate slower 
than any preassigned one as n (yo. 

Remark 4. Taking e = 0, Corollary [I] then leads to a polynomial bound for 
the singularity probability: 

P(Wn is singalur) < n~\ 

with I being any positive number. 
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Remark 5. For random sign matrices Tao and Vu [21] showed that for every 
A > there exists B > so that 

Recently, Tao and Vu [22j reported a result concerning the smallest singular 
value of a perturbed matrix too. Under some mild conditions, they proved that 

Compared with their results, U.9\) gives an explicit dependence between the 
bound on s„(W) and probability, while the relationship between A and B in 
[21] and [22] is implicit. In addition, U.9\) holds for general random matrices, 
while Tao and Vu's theorem basically applies to discrete random matrices. 

Remark 6. In this paper, we will use the letters B,Ki,K2 to denote some 
finite absolute constants. 

The argument of Theorem [2] is presented in the next section and the proof 
of the circular law is given in the last section. 

2. Smallest singular value 

In this section the smallest singular value of the matrix X perturbed by a 
constant matrix will be characterized. We begin first with the estimation of 
the so-called small ball probability. 

2.1. Small ball probability. The small ball probability is defined as 

(2.1) Pe{h)=snpP{\Sn-v\<e), 

where 

n 

(2.2) Sn = Y,bk7]k 

k=l 

with ?7i, • • • ,r]n being i.i.d. r.v.'s and b = (6i, ■ ■ ■ , 6„) G C" (see [I2|)- If each 
Tjk is perturbed by a constant a^. G C, then ^^(b) does not change, i.e. 

n 

(2.3) P,(b) = sup P(| V bkiVk - «fc) -v\<e), 

tit 

We first establish a small ball probability for big e via central limit theorem 
for complex r.v.'s rji, - ■ ■ ,rin. Before we state the next result, let us introduce 
some more notation and terminology. Re{z) and Im{z) will denote the real and 
imaginary part of a complex number z. Write rjik = Re{rik), r]2k = Ifn{f]k)-, 
^1 = ^ik = ^iVik - Erjikf, al = a^f. = E{r]2k - Er]2kY, (J12 = cim = 
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E{7]ik - Er]ik){r]2k - E7]2k) for /c = 1,2, •■■ ,n. For real r.v.'s ^ and r/, if 
{E{i - EC){v - Er])Y = E{i - EifE{r] - Erif > 0, then we will say that ^ 
and T) are linearly correlated. 

Theorem 3. Let rji, - ■ ■ ,r]n he i.i.d. complex r.v. 's with variances at least 1, 
E\rji\^ < B and let bi, - ■ • ,bn be complex numbers such that < Ki < \bk\ < 
K2 for all k. Then for every £ > 0, 



where C is a finite constant depending only on B, (Ji, (72 and au- 

Proof. Suppose first that Re{r}k) and Im{r]k) are linearly correlated, k — 
1, ■ ■ ■ ,n. Then i]k - Er]k = ^fc(l + ibo)/{l + blf'^ a.s., whcrc^ = (1 + 
b'^y/'^Reirjk — Er]k) and bo is an absolute real constant. Write bk = bk{l + 
ibo)/{l + ^o)^^^ which satisfies Ki < \bk\ < K2. Let bik = Re{bk) and 
b2k = I^ipk)- Noting that 

n n 

supP(|5„-i;| <£) < supP(| V6ifc^fc-i?e(T;)| < £, \y^b2k^k - Im{v)\ < e) 

vec vec 

and either Ylk=i^ik — or Ylk=i^2k ^ '"'-^1/2, we can complete the 

proof for the linearly correlated case by Berry-Esseen inequality. 

The case where Re{r]k) = or Im{r}k) — a.s. follows from Berry-Esseen 
inequality directly. 

Now suppose Re{r]k) and Im{rjk) are not linearly correlated, and P(^Re{r]k) = 
0) < 1, P{lm{r]k) = 0) < 1. Let bk = bik + ib2k and v = vi + iv2. Define 

n 

rjik = bikr]ik-b2kr]2k and 772^ = bik^k + hk^lik- Obviously, ^ E\fijk- Efijk\^ < 

k=l 

n n 

E E\bk{r]k - Er]k)f < 8B\\b\\lj = 1,2, where = ^ \bkf. In order to 
fc=i fc=i 
apply Berry-Esseen inequality, we need to get a lower bound for E\fjjk — Efjjk\'^. 
For J = 1, we have 

|2 



E\fiik - Efjiki 
bkn{\bikWik/\bk\ - \b2kW2k/\bk\ 



b\k^lk + ^2fc'^2fe - 26ifc62feO-12fe 



+2|6ife62fe||&fe| ^((7ife(72fe - sign(6ife&2fe)(7m)y 



For t e [0, 1], let fit) = [tai - y/T^a2f + 2t^/T^{ala2 ± a^)- So the 
smallest value a — mintg[o,i] f{t) of f{t) in [0, 1] is attainted at or 1 or some 
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to £ (0, 1). Therefore, a is a positive constant depending only on ai, a2 and 
ai2. Hence E\T)ik - Efiik\^ > a\hk\^. Similarly, E\fi2k - ET)2k\^ > a|6fcP- By 
Berry-Esseen inequality, one can then conclude that 



(2.5) sup P(| J2iVik - Ev,k) - ^il < 4) < ^ + C f 

DieM ^ V2 ||b||2 V llhl 



3 



and 



3 



" / II V> 

(2.6) sup P(| ^(r)2. - Efj2k) -^2\<^)<T^ + C( i- 

where C is a constant depending only on 5, o"i, o"2 and o"i2. 
Thus ( 12. 4p follows from (12. 5p . (12. 6p and the following inequality 

n 

e 



3 



s\ygP{\Sn- v\ < e) < sup P{\y2iVik - Efjik) - vi\ < 
Dec Vim f~ v2 



k=l 

n 



+ sup P(| ^(^72^ - ^??2fc) - t^2| < A=)- 

-"2^ ^ V2 

□ 

Theorem [3] only yields a polynomial rate n~^/^. Next, an improved small 
ball probability is needed for our future use. To this end, some concepts will 
be presented which are parallel to those of [T8] . 

Denote the unit sphere in C" by 5*""^. 

Definition 1. Let a G (0, 1) and r > 0. The essential least common denom- 
inator of a vector b G C", denoted by D{h) = D^^ri^), "is defined to he the 
infimum oft>Oso that all coordinates of the vector th are of distance at most 
a from nonzero integers except r coordinates. 

Definition 2. Suppose that 7,p G (0,1). A vector b G C" sparse if 
\supp{h)\ < 7?2. A vector h G S"^^^ is compressible if h is within Euclidean 
distance p from the set of all sparse vectors. All vectors b G S"'~^ except com- 
pressible vectors are called incompressible. Let Sparse = S par se{''y), Camp = 
Compij, p) and Incomp = Incompipf, p) denote, respectively, the sets of sparse, 
compressible and incompressible vectors. 

Definition 3. For some Ki,K2 > 0, the spread part of a vector b G C" is 
defined as 

h = {y/nhk)k(ia(h), 
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where the subset a(b) C {1, ■ ■ ■ ,n} is given by {k : Ki < y/n\bk\ < K2}. 
Similarly, for j = 1,2, define 

hj = {^/nbjk)kea{h), |bj| = {^/n\bjk\)kea{h), |b| = {Vn\bk\)kea{h), 
where bik and b2k denote, respectively, the real part and imaginary part of bk ■ 

Similar to the real case, the complex incompressible vector are also evenly 
spread, i.e. many coordinates are of the order ri"^/^. 

Lemma 1. Let h G Incomp{'y, p) . Then there is a set cri(b) C {1, ■ ■ ■ ,n} o/ 
cardinality |o"i(b)| > cn with c > so that for j = 1 or 2, 

(2.7) — < \b,k\ < ^— for all k G ai(b). 

2V2n Jin 



Proof. By Lemma 3.4 in [18], for b G Incompd, p), there is a set (T(b) of 
cardinality |cr(b)| > ^p'^'jn so that 

^ < \bk\ < for all k G a{h). 



'2n Jin 

Hence < l/^/yfi and \b2k\ < ^l\fin if /c G (T(b). On the other hand, 
either b\k or b2k must be bigger than p(2-\/2n)~^. The assertion follows. □ 

The following result refines Theorem [3l 

Theorem 4. Let b = ■ ■ ■ , 6n) £ whose spread part b is well defined 
(for some fixed truncation levels Ki, K2 > 0). Suppose < a < K1/6K2 and 
0<P < 1/2. 

(1) Suppose that rji,--- , //„ are i.i.d. real r.v.'s, or imaginary r.v.'s, or 
complex ones with linearly correlated Re{rjk) and Im{r]k),k = 1,2, ■ ■ ■ ,n. If 
E\T]k — Erjkl"^ = 1 and E\rik\^ < B, for any e >Q, then 

(2.8) Peih) <-^{e + \ . I +Cexp(-ca^/3n), 

VP \ ynmax{Dc,,/3„(bi),L'„,^„(b2)}y 

where C, c > depend only on B, Ki, K2. 

(2) Let Tji, - ■ ■ , T]n be i.i.d. complex r.v. 's with E\r]k — Er]k\'^ = 1 and E\r]k\^ < 
B, then l{2.8\) holds or 

(2.9) Pe(h) <-^le + ^ I + C exp(-ca'^/3n) 

where C,c> depend only on B, Ki, K2, o"i, cr2 and cx^. 
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Proof. Since -Pe(b) = sup P{\Sn — ESn — v\ < e), we can assume that Erjk = 0. 

(1) . We only consider the case where the r.v.'s {rjk} are reaL The other two 
cases follow from the real case. Let bj. = hik + ih2k and v = Vi + iv2- Noting 
that 

sup P(\Sn — v\ < e) 

n n 

< min ( sup P(| Y^feifc^/fc - Vi\ < e), sup P(| ^^62^^?/^ - < e)) 

Then Corollary 4.9 in [H] leads to ([21]). 

(2) . For the moment we assume that 

l<|&fc|<^ for all k. 

Let bk = bik + ib2k, Vk = Vik + 'iV2k and v = vi + iv2- It is observed that 
Theorem [3] implies Theorem H] for big values of e (constant order or even 
larger). Therefore we can suppose in what follows that 

e<li, 

where li is a constant which will be specified later. 

If the real part of rji is linearly correlated to the imaginary part of r]i , then we 
have fl2.8p . Therefore we assume in the sequel that rju is not linearly correlated 
to r]2i. 

Set Cfc = j^l^fc - ^fcl where = bikVik - b2k'n2k and is an independent 
copy of ^fc. Then 

(2.10) IeCI = T^E\b^kVlk-b2kV2k\'. 

As in the proof of Theorem [3] 

ECl > 2a > 0, 

where a is some positive constant depending only on ai, a2 and cri2. 

On the other hand, E(^ < 64B. The Paley-Zygmund inequality (p^) gives 
that 

which is a positive constant depending only on B, ai, (T2 and a^. Following 
[TS j we introduce a new r.v. (k conditioned on (k > v^; ^^at is, for any 
measurable function g 

^ Eg{Ck)m > VE) 
Eg{C,k) = ^ — 1=\ — , 
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which entails 

(2.11) Eg{Ck)>(3Eg{Ck)- 
From Esseen inequahty, one has 

n 

P,(b) < supPd^^fc-^il 



k=l 

(2.12) < C r" m/e)\dt, 

J --Kjl 

where 

n 
k=l 

With the notation (j)k{t) = Eexp{i^kt), it is observed that 

\Mt)\^ = Ecos{\bk\Ckt), 

and we then have 



\m<ll^w{-li^-\Mt)\i 

k=l ' 

= exp ( - E 5^ -(1 - cos(|6fe|at))) = exp ( - Eg{C 



k=l 

where 



g{t):=J2^^'{l\bk\t). 



■2' 

k=l 



This, together with (12.111) . gives 

m)\<exp{-PEg{Ckt)). 
Consequently, fl2.12p becomes 

Peih) < C exp ( - /3 Eg{C,,,t/e)) dt 



<CE [ exp{-(3g{at/e))dt 

.7T/2 

(2.13) < C sup / exp ( - (3g{zt/e)) dt. 

z>\/a J—n/2 



z>_y/o, J— 7r/2 

Let 



M := max qizt/e) = max > sm'^(\bk\zt/2e) 

k=l 



12 
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and the level sets of g be 

T{m,r) := {t : \t\ < r, g{zt/e) < m}. 
As in [THj, one can prove that 

n 

- < M <n, 

by taking e < {'^^^/a)/4 = li. All the remaining arguments including the 
analysis for the level sets T{m,r) are similar to those of |18J and so we here 
omit the details. Thus, one can conclude that for every e > 

(2,14) |P.(b)|<^(. + ^^)+Cexp(-^^), 

where < r < n, \h\ = - , |&„|) and C,c > are positive constants 

depending only on B, (Xi, (J2 and (Tu-- 

Finally, combining fl2.14p and Lemma 2.1 in [T8] one can obtain the small 
ball probability for complex case (when applying fl2.14p to the spread part of 
the vector b one can suppose that -ft^i = 1 by re-scaling bk and a). Thus we 
complete the proof. □ 

To treat the compressible vector, the following lemma is needed. 

Lemma 2. Suppose that rji,--- jfjn are i.i.d. centered complex r.v.'s with 
E\irik\^ = 1 and E\r]k\^ < B. Let {ajk,j,k = 1, ■ ■ ■ ,n} be complex numbers. 
Then for < A < 1 and any vector b = (6i, • • ■ , 6„) G S"'~^ there is fi & (0, 1) 

n 

such that the sum Snj = X] ^fc(% ~ (^jk) satisfy 

k=l 

P{\Snj\ > X) > 

where depends only on X and B. 
Proof. Simple calculation indicates that 

n 

ElSnjl"^ = I bkajkl"^ + 1. 

k=l 
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On the other hand by Burkholder inequahty we have 

n n 

k=l k=l 
n n 

< c{\Y,bka,,\' + (Yl \bk\'E\r^k\Y' + E l^^l'^l 

k=l k=l k=l 

< C (\J2bkajk\^ + l + B 

\ k=l 

Hence Paley-Zygmund inequahty gives that 

P(|5„,| > A, > > 



Vk] 



where 

n 

Cnj I ^ ^ bkCljk I 
k=l 

Take 

(J2 + 1 _ A2»3 



Then one can conclude that 

/i := min f{t) > 

ie(0,oo) 

and then 

p{\Snj\ > x)>^l>o 

where /x depends only on A and B. □ 

2.2. Proof of Theorem [2]. The whole argument is similar to that of [L8\ and 
we only sketch the proof. For more details one can refer to [TS] . 

Since S"~^ can be decomposed as the union of Comp and Incomp, we then 
consider the smallest singular value on each set separately. 

By Lemma [2] there are Ci > and v G (0, 1) depending on /i only so that 

P(||Wb||2 < civ^) <v\ he S"-\ 

Actually, the proof is similar to that of Proposition 3.4 in [H]. The only 
difference is that we should use our Lemma [2] instead of Lemma 3.6 in [T3j. 
Therefore similar to Lemma 3.3 in \I8\, we have, there exist 7,p, C2,C3 > so 
that 

(2.15) P( inf ||Wb||2 < C2n^^^) < e"^^" + P(||W|| > Kn^^'^), 

be Comp (7, p) 



14 GUANGMING PAN AND WANG ZHOU 

where K > 1. 

Let Xi , • • • , X„ denote the column vectors of W and Hj. the span of all 
columns except the k-th column. One can check that Lemma 3.5 in [TS] is still 
true in complex case and hence 

1 " 

P( inf \\Wh\\2<epn-^/'^) < — V P(dist(Xfc, i/fc) < e) 

be/ncomp(7,p) 'jn 

1 " 

(2.16) < — VP(|(Yfc,Xfc)| <£), 

where is any unit vector orthogonal to Hk and can be chosen to be inde- 
pendent of Xfc. Here (■, ■) is the canonical inner product in C^. 

When all {Xjk} are real r.v.'s, or when Re{Xjk) and Im{Xjk) are linearly 
correlated or when Re{Xjk) = we have 

P(| (Yfc, Xfc) I < e and Ur) < P{Yk E Comp and Uk) 

+P(|(Yfc,Xfc)| < £, Yfc G Incomp and Uk), 

(2.17) 

where Uk denotes the event that ||W|| < Kri^f^. One can check that Lemma 
3.6 in |18] applies to complex case and hence 

P(Yfc G Comp and Uk) < e""*'', 

where C4 is a constant depending only on P, ai, 02 and a\2- Further, 

P(|(Yfc,Xfc)| < e, Yfc G Incomp and Uk) 
2 

- 5Z ^ (^J'^' ^a,/3n(Yjfe) < e"^" and Y^ G Incomp^ 

2 

+ ^P [/(P,,^„(Y,fc) > e'^" and Y^ G Incomp)P {\{Yk,Xk)\ < e\Y, 

-)■=! 



where Vik and V2A,. denote, respectively, the events that the real part and imag- 
inary part of the vector Y^ G Incomp satisfy (12. 7p in Lemma [11 Yi^ and 
denote, respectively, the spread part of the real part and imaginary part of the 
vector Yfc. By (12.81) in Theorem |4] and (12. 3p we have 

l{D^,f3n{Yjk) > e^")P(|(Yfc,Xfc)| < e\Y,) < cge + cee"'^^", 
where C5, cq, C7 are positive constants depending only on P, ai, a2 and ai2- 
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On the other hand, 



P {V,,^ f/^, /^„,,n(Yi,) < and Y, e Incomp 
< 5^P(YfcG^^, t/^andVi,). 

Here the level set So ^ S*""^ is defined as 

Sd := {Yfc G /ncom^j : D < Do,,no/2(Yik) <2D}. 

and 

V = {D: Do<D< e"\ D = 2^ ke Z}, 

where a and Dq are some constants. For more details about a and Dq, see 
|18j . Further, one can similarly prove that Lemma 5.8 in [18] holds in our case 



and therefore we obtain 

P(Yfc G Sd and Uk) < e"", 

which, combined with the fact that the cardinal number \V\ is of order n, then 
implies that 



P (Vik, Uk, D^^pni^ik) < e"^ and Y^ G Incomp^ < e 
where cs > 0. Similarly, one may also show that 

P {V2k, Uk, D^^pn{^2k) < e'" and Y^ G Incomp^ < e"^*". 
Picking up the above argument one can conclude that 

P(|(Yfe,Xfc)| <eand ||W|| <Kn^l^) <Ce + e-^''', 
which further gives that 

(2.18) P{ inf ||Wx||2 < £pn~i/2^ < ^(e + c") + P(||W|| > 

Incomp {■y,p) 

where C > and c G (0, 1) depend only on K, B, ai, and a\2- 

For all the remaining case, i.e. Re{Xjk)Im{Xjjs) ^ 0, and Re{Xjk), Im{Xjk) 
are not linearly correlated, one has 

P(|(Y,,X,)| < e and U^) < P{D^M\^k\) < e^" and Uk) 



+E 

(2.19) 



/(D,,^„(|Yfc|) >e^")P(|(Yfc,Xfc)| <£ 



Yi 



and one can similarly obtain (12.181) for complex case. Theorem [2] follows from 
(l27[5|) - fl2A9|) immediately. 
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3. The convergence of logarithmic potential and circular law 

In this part the logarithmic potential will be used to show that the circular 
law is true. According to Lower Envelop Theorem and Unicity Theorem (see 
Theorem 6.9, p. 73, and Corollary 2.2, p. 98, in [19]), it suffices to show that 
the corresponding potential converges to the potential of the circular law. 

To make use of Theorem [2] one needs to bound the maximum singular value 
of W. To this end, we would like to present an important fact which was 
proved in [25], that is, if (1) EXjk = 0, (2) \Xjk\ < Vnen, (3) E\Xjk\^ < 
1 and 1 > E\Xjk\'^ 1 and (4) E\Xjk\^ < ci^SnY-^ for / > 3, where e„ ^ 
with the convergence rate slower than any preassigned one as n — > oo. Then 
for any K > 4 

(3.1) P(||XX*|| > Kn) = o(n"'), 

where / is any positive number (proved for real case in [25], for complex case 
see Chapter 5 of [2]). 

Let the random matrix X = (Xjk) with Xjk = XjkI{\Xjk\ < ^/nSn)- Then 
one can show that 

(3.2) P(X 7^ X,io.) = 0, 

see Lemma 2.2 of [25] (the argument of the complex case is similar to that of 
the real one). Here the notation i.o. means infinitely often. Thus it is sufficient 
to consider the random matrix X in order to prove the conjecture. 

Taking A„ = iJX — zs/nl in Theorem [2] one can obtain that 

(3.3) P(s„,(X - z^l) < en-^/2) < Ce + + P(||X - z^/^I\\ > Kn^'^), 

where i?X = (EXkj)- Here one should note that from (13.31) re-scaling the 
underlying r.v.'s is trivial. Moreover 

E\X 1^ 

\\E±-zy/nI\\ < k|v^+ " . 

Therefore, applying (13. ip and choosing an appropriate K in (13.31) . we have 

(3.4) P(s„(X - z^I) < en-^l'^) < + c" + n"' 

where both C > and c G (0,1) depend only on K, P|Xii|^, P(Pe(Xii))^, 
P(Jm(Xii))^ and PPe(Xii)Jm(Xii). 

In the sequel, to simplify the notation, we still use the notation X instead 
of X and /i„(x, y) instead of the empirical spectral distribution corresponding 
to X. But one should keep in mind that {X^j] are non-centered and \Xkj\ < 
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Let 

for each z = s + it ^ C Here (■)* denotes the transpose and complex conjugate 
of a matrix. Let Vn{x,z) be the empirical spectral distribution of Hermitian 
matrix H„. 

Before we prove the convergence of the logarithmic potential of ^n{x, y), we 
will characterize the relation between the potential of the circular law n{x, y) 
and the integral of logarithmic function with respect to v{x,z), the limiting 
distribution of z) as below. 



Lemma 3. 




1 1 

log- -dfi{x,y) = — / \ogxv{dx,z). 

\x + iy-z\ 2 Jo 



Proof. Let x + iy = re*^, r > 0. One can then verify that 



/7r 
logl^-re^V^ 
-TT 



2tc logr if \z\ < r, 
2tc log \z\ if \z\ > r. 



It follows that 

1 , . . {2~^{l - kP) if \z\ < 1 




(3.6) / / log I — — Mx,y) , , I I -r I I 1 

\x + iy~z\ -log|z| if |z| > 1. 



On the other hand by Lemma 4.4 in [T] one has 

d_ r 

ds Jo 

where 



logxv{dx, z) = g{s,t), 



2s 



9{s,t) 



if S2 + t2 > 1 



2s otherwise. 



Therefore for any z = s + it, Zi = Si + it with > 1, we have 
(3.7) 

'0 poo ps 

\ogxv{dx,z)— / logxf (dx, Zi) + log = / g{u,t)du + log \zi\ 



Sl 



Let Sl — i> oo and then \zi\ oo. Therefore, from Lemma 4.2 of the left 
and right end point, Xi and X2, of the support of v{-,Zi) satisfy 



y^ = l + 0(l), J = l,2, 

ki 
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which imphes that 



I \ogxv{dx, zi) —\og\zi^ = I log 
as Si oo. In addition, 

(3.8) I g{u,t)du + \og\zi\^ = 

J SI 

Thus Lemma [3] is complete. 



X 



^1 



:v{dx, Zi) 



if|z|<l 
log bp if \z\ > 1. 



□ 



We now proceed to prove the convergence of the potential of fin{x,y). The 
potential of /i„(a;, y) is 



--log 
n 

poo 



det (n-^/^X - zl) 
det(H„) 



(3.9) = -- / \ogxVnidx,z) 

^ Jo 

where I is the identity matrix. We will prove 



log XVn{dx, z) 



log XV {dx, z) 



'0 JQ 

as n oo. Observe that by the fourth moment condition 



Amax(H„) < 2(A„,ax(ri"'XX*) + \z\ 



2\ 



+ 2\z\ 



where Ainax(H„) denotes the maximum eigenvalue of H„. It follows that for 
any 5 > and sufficiently large n 

oo 

log x(f„((ix, z) — v{dx, z)) I 

-4-2S 

8+2|2|2+5 

\ogx(^Vn{dx, z) — v{dx, z)) I 

-4-2(5 

< (I log(n-^-2^)| + log(8 + 2|^p + 6)) \Mx, z) - v{x, z)\\ 
0. 

Here we do not present the proof of the convergence of Vn{x,z) to v{x,z) 
with the desired convergence rate for each z. Indeed, the rank inequality (see 
Theorem 11.43 in [2]) can be used to re-centralize Xjk and then Lemma 10.15 
in [2] provides the convergence rate under the assumption £'|Xiip+'^ < oo. 
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On the other hand, by (13.41) and Borel-Cantelh lemma, 
1 



log 



det(H, 



Here we take e = n^^^^, 5 > in (13. 4p . One should observe that e in Theorem 
5.1 in [18] can be dependent on n, so does e in Theorem [2l Moreover, from 
Lemma 4.2 in fl] one can conclude that 



/ \ogxv{dx, z) — >• 0. 
Jo 



Therefore 

(3.10) / iogxt;(da;,;z). 

2 Jo 

Again by the fourth moment condition 

|Ai(X)|< (A^,>-iXX*))'/'^2. 

So for all large ra, almost surely /i„ is compactly supported on the disk {z : \z\ < 
2 + 5} . Here we have used the fact that all the eigenvalues of an n x n matrix 
are dominated by the largest singular value of the same matrix. Consequently 
Theorem [T] follows from Lemma [3] combined with Lower Envelop Theorem and 
Unicity Theorem for logarithmic potential of measures (see Theorem 6.9, p. 73, 
and Corollary 2.2, p.98, in [19j). 
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